Slim-o-matic: a Semi-Automated Way to Generate Gene Ontology Slims
نویسندگان
چکیده
The Gene Ontology (GO) currently contains over 40,000 terms describing the locations, activities and processes of gene products. Several millions of gene products have been annotated using the GO, and these annotations are routinely used for multiple applications. However, because of the di↵erence of granularity in the annotations, it is useful to summarize GO annotations using GO slims. GO slims contain a subset of the GO terms, providing a higher-level, broader overview of the ontology while abstracting the finer details. Compiling GO slims is a time consuming process relying on manual human expertise, the process of creating the slims is often poorly documented, and maintaining and updating them can be di cult. In this paper, we present a semi-automated way to generate GO slims based on the annotation data available. We applied the tool to two di↵erent use cases, one for data overview in the newly released EBI Metagenomics pipeline, and one for gene-disease enrichment analysis using the DisGeNET platform. The slim-o-matic tool supports choosing the best terms for the slim, ensuring they are representative of the dataset, and have the best coverage using the minimal number of terms.
منابع مشابه
SLiMScape 3.x: a Cytoscape 3 app for discovery of Short Linear Motifs in protein interaction networks
Short linear motifs (SLiMs) are small protein sequence patterns that mediate a large number of critical protein-protein interactions, involved in processes such as complex formation, signal transduction, localisation and stabilisation. SLiMs show rapid evolutionary dynamics and are frequently the targets of molecular mimicry by pathogens. Identifying enriched sequence patterns due to convergent...
متن کاملSLiMSearch 2.0: biological context for short linear motifs in proteins
Short, linear motifs (SLiMs) play a critical role in many biological processes. The SLiMSearch 2.0 (Short, Linear Motif Search) web server allows researchers to identify occurrences of a user-defined SLiM in a proteome, using conservation and protein disorder context statistics to rank occurrences. User-friendly output and visualizations of motif context allow the user to quickly gain insight i...
متن کاملHH-MOTiF: de novo detection of short linear motifs in proteins by Hidden Markov Model comparisons
Short linear motifs (SLiMs) in proteins are self-sufficient functional sequences that specify interaction sites for other molecules and thus mediate a multitude of functions. Computational, as well as experimental biological research would significantly benefit, if SLiMs in proteins could be correctly predicted de novo with high sensitivity. However, de novo SLiM prediction is a difficult compu...
متن کاملA Comparative Study of Short Linear Motif Compositions of the Influenza A Virus Ribonucleoproteins
Protein-protein interactions through short linear motifs (SLiMs) are an emerging concept that is different from interactions between globular domains. The SLiMs encode a functional interaction interface in a short (three to ten residues) poorly conserved sequence. This characteristic makes them much more likely to arise/disappear spontaneously via mutations, and they may be more evolutionarily ...
متن کاملThe identification of short linear motif-mediated interfaces within the human interactome
MOTIVATION Eukaryotic proteins are highly modular, containing multiple interaction interfaces that mediate binding to a network of regulators and effectors. Recent advances in high-throughput proteomics have rapidly expanded the number of known protein-protein interactions (PPIs); however, the molecular basis for the majority of these interactions remains to be elucidated. There has been a grow...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016